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METHOD AND APPARATUS FOR ESTABLISHING A BOUND ON THE EFFECT 
OF TASK INTERFERENCE IN A CACHE MEMORY 

Field of the Invention 

The present invention relates generally to methods and apparatus for evaluating 
software, and more particularly, to methods and apparatus for evaluating the impact of task 
interference on the execution time of a primary task that may be interrupted by one or more 
secondary tasks. 



10 Background of the Invention 

Processors often use a cache to improve performance and decrease system costs, 
hi Caches temporarily store recently accessed information (blocks of instructions or data) in a small 
S memory that is faster to access than a larger main memory. Caches are effective because a block 
® that has been accessed once is likely to be accessed soon again or is often near a recently 

sssi? 

W accessed block. Thus, as a task executes, the working set of a task (the instructions and data 
3 currently required for the task) is stored in the cache in the event that the information may be 
p 5 accessed again. A cache typically maps multiple blocks of information from the main memory 
D 1 into one place in a cache, typically referred to as a "set." A "block" refers to the minimum unit 
O of information that can be present in a cache and a "frame" is the place in a cache where a single 
|! block may be stored. In a set associative cache, multiple frames are grouped into sets. For 
example, a two-way set associative cache has two frames in each set. 

In many embedded applications, a primary task may be interrupted by one or more 
secondary tasks. Thus, following an interruption, as a secondary, interrupting task executes, the 
working set of the interrupting task is stored in the cache, potentially evicting the working set of 
25 the primary, interrupted task and thereby decreasing the performance of the primary, interrupted 
task when it resumes execution. When the primary, interrupted task resumes execution, portions 
of the working set that have been evicted from the cache must be obtained from main memory, 
causing "cache misses." Thus, the execution time of the primary, interrupted task is extended by 
the time taken to run the secondary task plus the miss penalty due to obtaining evicted portions of 
30 the cached information from the main memory. 
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A critical parameter of a real-time task is its maximum response time over all 
possible inputs. In some systems, a task scheduler allocates a processor's cycles among multiple 
tasks to meet their response time requirements. Thus, the worst-case execution time of each task 
must be known. When instruction timings or execution paths are uncertain, conservative (worst 
case) assumptions are often made that may waste system capability or lead to an unnecessarily 
costly system. If the resulting worst-case time-bound is loose, a task is allocated more execution 
time than it can possibly use, wasting system capability and performance. Efficient system design 
thus requires methods to tightly estimate the effect of complex cache behavior. A need therefore 
exists for methods and apparatus that evaluate the additional execution time of the primary, 
interrupted task attributed to any interrupts. A further need exists for methods and apparatus that 
establish a bound on the effect of task interference in an instruction cache shared by multiple 
tasks. 

Summary of the Invention 

Generally, a method and apparatus are disclosed for establishing a bound on the 
effect of task interference in an instruction cache shared by multiple tasks. The bound 
established by the present invention is the maximum number of "live" frames of a given task that 
are coexistent during the execution of an application. As used herein, a "live cache frame" is a 
cache frame that contains a block that is accessed in the future without an intervening eviction. 
The present invention recognizes that the eviction of blocks from a live frame by an interrupt 
causes a future miss that would not otherwise occur and that evictions from live frames are the 
only evictions that cause misses that would not otherwise occur. 

The present invention thus provides a more accurate estimate of the maximum 
additional execution time of a task that results from servicing an interrupt during its execution. 
The maximum additional execution time is attributed to three components, namely, the execution 
time of the interrupting task, the time to obtain and replace the blocks that are evicted from the 
cache by the blocks of the interrupting task and the consequences of altering the least recently 
used status in the cache. 

The maximum number of live frames of a given task that are coexistent during the 
execution of an application can be generated, for example, by analyzing a time-stamped log of 
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accesses to the cache to determine which frames are "live" at a given time. In a further 
variation, an application may be simulated for a period of time during which a candidate 
maximum is continually calculated and updated as required. Generally, a greater candidate 
maximum replaces a lesser candidate maximum during simulation. An algorithm can be 
5 generated to determine the maximum number of coexistent live frames experienced during a 
simulation and the point at which the maximum occurs upon completion of a simulation. 

The general approach of the present invention assumes that an interrupt evicts the 
blocks in all of the live frames in a cache during its execution and is independent of the way 
interrupting tasks map to frames in an instruction cache. If the blocks of an interrupting task do 
10 not map to a set that contains a live application frame, then the contents of the live application 
frame cannot be evicted by the interrupt. Thus, in further variations of the present invention, 
additional improvements are obtained by exploiting knowledge of the character of an intervening 
task to achieve a tighter bound when possible. In one variation, the sets that contain an 
application's live frames are compared with the sets accessed by an interrupt to determine the 
maximum number of live-sets that may be accessed by an interrupt. Another variation exploits 
the fact that only an interrupt' s accesses to the application's frames that are live at the interrupt 
S point may cause future misses. Thus, the effect of an interrupt at each possible interrupt point is 
W determined and the maximum of these becomes the worst-case bound. 

g A more complete understanding of the present invention, as well as further 

% features and advantages of the present invention, will be obtained by reference to the following 
detailed description and drawings. 

Brief Description of the Drawings 

FIG. 1 illustrates a conventional scheme for addressing a typical cache; 
25 FIG. 2 illustrates the effects of a context switch in an instruction cache; 

FIG. 3 illustrates a set in a two-way set-associative cache where an interrupt by an 
intervening task alters the least recently used status of the set; 

FIG. 4 illustrates the liveliness state of a cache frame in accordance with the 
present invention during execution of a task; 
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FIGS. 5 A and 5B illustrate the distribution of live sets and frames in a cache 
during the execution of an exemplary adaptive multi-rate voice coder/decoder application; 

FIG. 6 illustrates a variation of the present invention where the sets that contain an 
application's live frames are shown with the sets accessed by an interrupt to determine the 
5 maximum number of live-frames that may be affected by an interrupt; 

FIG. 7 illustrates a set vector that represents the sets accessed by an interrupt over 

its entire time; 

FIG. 8 illustrates a technique for determining the worst-case intersection between 
a set vector and a profile of an application's live set; and 
10 FIG. 9 is a schematic block diagram showing the architecture of an exemplary 

software developer's computer incorporating features of the present invention. 

5; Detailed Description 

H The present invention provides techniques for establishing a bound on task 

O interference in real-time systems that employ an instruction cache shared by multiple tasks. 
^ Specifically, the present invention estimates the additional execution time of a task that results 
Cl from servicing an interrupt during its execution. The execution time of a task is extended by an 
fil interrupt due to three components, namely, the execution time of the interrupting task, the time to 
S obtain and replace the blocks that are evicted from the cache by the blocks of the interrupting 
% task and the consequences of altering the least recently used status in the cache. 

The bound established by the present invention is the maximum number of live 
frames that are coexistent during the execution of an application. As used herein, a 'live cache 
frame" is a cache frame that contains a block that is accessed in the future without an intervening 
eviction. The present invention recognizes that the eviction of a task's block from a live frame 
25 by an interrupt causes a future miss that would not otherwise occur and that evictions from live 
frames are the only evictions that cause misses that would not otherwise occur. Thus, only the 
evictions of live cache frames cause future misses that would not otherwise occur. The 
maximum number of live frames coexistent during the execution of an application bound such 
misses, independent of the intervening tasks. In further variations of the present invention, 
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knowledge of the character of an intervening task is exploited to achieve a tighter bound when 
possible. 

FIG. 1 illustrates a conventional scheme for addressing a cache 100. As shown in 
FIG. 1, the exemplary cache 100 is a two-way set associative cache (two frames per set). A 
5 portion of the address 110 of a block of main memory is a set index value identifying the 
corresponding set in the cache 100. While the present invention may be incorporated into all 
instruction cache organizations, the present invention is illustrated with a two-way set associative 
instruction cache that has two frames at each set address. 

Any frame in a set may hold a block from main memory that maps to the set. A 
10 newly accessed block is usually stored in the set's least recently used (lru) frame and evicts its 
contents if occupied. An evicted block, because it is the least recently used block, is less likely to 
O be accessed in the near future than those in other frames of the set Least recently used block 
X information is associated with each set for this purpose and is updated during each access. 
**\ The cache 100 is accessed with the address 1 10 of the requested block. The index 

S3 portion 120 is used to access an entry (tag) in a table (directory) 140 that contains the address of 
r the block. Each entry contains the address of the block (tag) that is in the corresponding frame of 
|j the set. The tags are compared using a comparator 150 with the address 110 of the requested 
W block to find the frame that holds the block, if any. A cache miss results if the block is not in any 
O of the frames. Since many blocks used by a program (or a set of programs) may map to the same 
% set (because their index is the same), they share the frames in the set and evict each other over 
time as necessary. 

As previously indicated, the present invention estimates the effect of an interrupt 
on the execution time of a task. Specifically, the present invention estimates the additional 
execution time of a task a that results from servicing an interrupt during its execution. Typically, 
25 a designer employing conventional techniques may be compelled to make one or more of the 
following conservative and simplifying assumptions: 

L Task a's blocks fill the entire cache at the time of an interrupt; 

2. All of task a's blocks are evicted during execution of an interrupt; or 

3. All evicted blocks are accessed by task a after it resumes execution. 
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In an exemplary two-way set associative cache having 128 sets, where a cache 
miss requires 12 cycles to service, then the completion of task a is delayed 3072 cycles (128 sets 
x 2 frames/set x 12 cycles/frame to service) beyond that consumed by the interrupts execution 
(which is assumed to be known). This result is multiplied by the maximum number of interrupts 
5 that may occur during the execution of task a. This conventional approach establishes a valid but 
potentially wasteful bound because many simplifying assumptions are made. A less wasteful 
analysis requires a more precise knowledge of task behavior in the cache. 

FIG. 2 illustrates the cache related effects of a context switch while a task (task a) 
is executing with other tasks that time-share an instruction cache. As shown in FIG. 2, the 
10 execution time of task a is extended by an interrupt via three effects, namely, (i) the interrupt' s 
execution time (labeled x in FIG. 2), (ii) the interrupt evicting task a's blocks (labeled y in FIG. 
q 2), and (iii) the consequences of altering the least recently used status in the cache (labeled z in 
FIG. 2). The first effect is due to the fact that while the interrupt is executing the interrupted task 
is not. The second effect is the additional time taken to fetch application blocks that are evicted 
% from the cache by the interrupt' s blocks. The potential effect of the perturbation on the least 

recently used status is subtle and discussed hereinafter. 
O FIG. 3 illustrates a set in a two-way set-associative cache where an interrupt by an 

m 

ft! intervening task alters the least recently used status of the set, thereby degrading the performance 
5j of a previously executing task when it resumes execution. Assume that an application (app) is 
% executing and one of its blocks occupies the left frame and that the least recently used frame is 
the right frame (empty) at time ti. An interrupt now occurs. A block of the interrupting task is 
inserted into the right frame during its execution (because it is the least recently used frame) at 
time t 2 . The left frame is now the least recently used frame. After the interrupt completes, the 
original application (app) resumes execution. Because the least recently used information was 
25 perturbed by the interrupt, the next block stored into the set by the application evicts its own 
block, a block that was resident before the interrupt, and leaves the interrupt' s block in the right 
frame undisturbed. If the application's block was in recent use at the time of the interrupt, it may 
be assessed again, and incur a miss that would not otherwise have occurred. Thus, an interrupt 
may adversely affect an application without evicting one of its blocks. 
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The conservative and simplifying assumptions employed by conventional 
techniques and discussed above, preclude the need to consider least recently used alterations. 
Since an interrupt' s blocks are assumed to fill the entire cache, none of the application's blocks 
are assumed to be resident when it resumes execution so the state of the least recently used 
5 information is immaterial. 

Live Cache Frames 

The conservative and simplifying assumptions employed by conventional 
techniques and discussed above fail to consider the following factors: 
1. Tasks a's blocks may not fill the entire cache; 
10 2. Not all of task a's blocks may be evicted by an interrupt; 

3. Not all evicted blocks may be subsequently accessed by task a; and 

~J 4. Not all evicted and subsequently accessed blocks incur a miss that would not 

O 

5 otherwise have occurred. 

yy 

H The present invention takes these factors into account to provide an improved 

I s s 

O analysis for estimating the additional execution time of a task caused by any potential interrupts. 
A cache frame is defined herein to be "live" at a time t if and only if the cache frame contains a 
block at time t that is accessed in the future without an intervening eviction. FIG. 4 illustrates the 

S 'S s 

B liveliness state of a cache frame during a task's execution. As shown in FIG. 4, three blocks (a, b, 
and c) are inserted into a frame and accessed at various times. The frame is shaded in live time 
periods. An insertion is caused by an initial access. Block a is inserted and accessed at t 0 . It is 
subsequently accessed at t 3 and t 5 . After t 5 block a is not accessed again before block b is 
inserted into the frame at t 9 . Consequently, the frame is live from time t 0 , the first access of 
block a, to time t 5 , the last access of block a. The frame is not live from times t 5 to t 9 because the 
frame does not contain a block during that time that is accessed in the future without an 
25 intervening eviction. Note the situation at time ti 3 . Block a is inserted and accessed, but is not 
accessed in the future. Consequently, the frame does not become live at time ti 3 . 

The present invention recognizes that the eviction of a task's block from a live 
frame by an interrupt causes a future miss that would not otherwise occur and that evictions from 
live frames are the only evictions that cause misses that would not otherwise occur. 



26 
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Achieving a Tight Bound 
For purposes of illustration, a live cache frame distribution generated by an 
adaptive multi-rate voice coder/decoder (AMR) application is studied. It is again assumed that 
the instruction cache is a two-way set associative cache with 128 sets and that a cache miss 

5 requires 12 cycles to service. There are 256 frames in the cache (two frames in each set). FIGS. 
5A and 5B illustrate the distribution of live sets and frames in the cache during the execution of 
the exemplary adaptive multi-rate voice coder/decoder. This distribution was obtained through 
cycle-accurate simulation of an implementation of the SC100 Architecture. FIG. 5A illustrates 
the total number of live frames over time during the execution of the exemplary adaptive multi- 

10 rate voice coder/decoder (time is expressed in processor clocks). For example, there are about 40 
live frames in the cache at the 2xl0 5 cycle point and a maximum of 162 occurs at approximately 

O t he 2.4xl0 5 cycle point. Assume for now that this is the maximum over the AMR's entire 

fjl execution. 

fj Thus, the maximum number of additional misses that an interrupt can possibly 

Q cause in this application and in this cache is 162, regardless of the time at which the interrupt 

51 

» occurs or the number of blocks that the interrupt may cause to be inserted into the cache during 
pi its execution. It is therefore independent of the nature of the interrupt. The actual number of 
2J misses caused does depend on the nature of the interrupt, but 162 is a valid bound independent of 
O the interrupt. The effect of the least recently used alterations discussed above is subsumed in 
20 these results. Because all application blocks in live frames are assumed to be evicted, a block 

evicted due to a least recently used alteration must be in a dead frame, and therefore does not 

affect execution time. 

Thus, the completion of AMR may be delayed a maximum of 1944 cycles beyond 

that consumed by the interrupt' s execution, since there is a maximum of 162 additional misses 
25 caused by the interrupt, and a miss takes 12 cycles (162x12 = 1944). This bound should be 

compared with the 3072 cycles calculated in the simplistic approach, a 37% decrease. Similar 

results were observed for an exemplary Viterbi decoder application. 

FIG. 5B illustrates the distribution of sets containing at least one live frame 

(termed live-sets) in time and by set location. A horizontal line in the graph represents the time 
30 extent of a live-set whose location is its y-axis intercept. The way an application's blocks map to 
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cache sets and its live frame distribution may be studied with such a graph. One observation is 
that the worst case effect of cache interference may be decreased if interrupts may be disabled or 
deferred during code segments in which there is high live frame coexistence (e.g., during peaks 
in FIG. 5A). While this is not always practical, it may be beneficial in some situations. 
5 Thus, a bound on the additional misses incurred by a task via cache interference 

achieved with the above method depends solely on the maximum number of coexistent live 
frames that may be generated by the task. Therefore, nested or multiple interventions do not alter 
the bound. A caveat is that it must be possible to find the maximum number of live frames 
coexistent during the execution of an application under all possible circumstances. Graphs like 
10 those in FIGS. 5 A and 5B may be generated under various circumstances to allow post-execution 
hk analysis of the program, but formidable amounts of data must be recorded and processed to do 
|{ so. The graphs in FIGS. 5 A and 5B can be generated, for example, by analyzing a time-stamped 
log of accesses to the cache components to determine which frames are "live" at a given time, 
W similar to the analysis discussed above in conjunction with FIG. 3. 

|| In a further variation, an application may be simulated for a period of time during 

* which a candidate maximum is continually calculated and updated as required. Generally, a 

O 

R3 greater candidate maximum replaces a lesser candidate maximum during simulation. A 
ffi candidate maximum may be updated or replaced after it is established since the liveliness of a 
S frame at any time depends on a future read with no intervening eviction. A candidate maximum 
20 becomes the true maximum at the end of a simulation when no further updates are possible. 

Negligible data is accumulated in this approach. An algorithm can be generated to determine the 

maximum number of coexistent live frames experienced during a simulation and the point at 

which it occurs upon completion of a simulation. 

The approach of the present invention is independent of the way intervening tasks 
25 map to frames in an instruction cache. Two refinements are now presented that produce a tighter 

bound but depend on this mapping. 

A First Refinement 

The general approach of the present invention assumes that an interrupt evicts the 
blocks in all of the live frames in a cache during its execution. If an interrupts blocks do not 
30 map to a set that contains a live application frame, then the contents of the live application frame 
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cannot be evicted by the interrupt. FIG. 6 illustrates a further variation of the present invention 
where the sets that contain an application's live frames are compared with the sets accessed by an 
interrupt to determine the maximum number of live-frames that may be affected by an interrupt. 

As shown in FIG. 6, two sets 610-A and 610-1 are formed. The elements of set 
5 610-A are the live-sets generated by the application during its execution. The elements of set 
610-1 are the cache sets accessed by the interrupt during its execution. The intersection of the 
two sets 610-A and 610-1 contains the maximum number of live-sets accessed by the interrupt. 
Therefore, the cardinality of the intersection of the sets multiplied by the number of frames in a 
cache set is the maximum number of application misses that may be caused by the interrupt. The 
10 multiplication is necessary because an interrupt may access a set multiple times and evict blocks 
O from multiple live frames in the set. The result is independent of the time at which an interrupt 

5f occurs. 

m 

h A caveat is that the contents of both sets be independent of the data processed by a 

5 task. If data may change the cache sets accessed, the contents of the sets shown in FIG. 6 are data 
p dependent, thereby invalidating the approach. Some sensitivity to data may be accommodated 

O with the following procedure. If data changes the paths taken through an interrupt' s or 

ft! 

ft! application's code and different cache sets are accessed on the different paths (or in the case of 
an application, different live-sets are generated), a set for each possible path is made and a 
^ comprehensive set is generated from their union. This results in two comprehensive sets 
20 (application and interrupt) and the cardinality of their intersection bounds the number of 
additional misses that may be caused by the interrupt. 

It is noted that even if an application's instruction path is dependent on data 
processed, the live-sets generated may not change. This is because blocks on different paths may 
map to sets that are live-sets at some point (the time that it is live is not relevant) or, if they do 
25 not, they may map to sets that do not become live as a consequence (the blocks are read once, on 
their first access). Similarly, blocks accessed on alternate paths in an interrupt' s code may map to 
the same sets or sets already used by the interrupt. 

A Second Refinement 
The first refinement discussed above does not exploit the fact that only an 
30 interrupt 's accesses to the application's frames that are live at the interrupt point may cause 
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future misses. The frames that are live before or after an interrupt point do not influence future 
misses caused by the interrupt. FIG. 5B shows that the live frame pattern varies over time due to 
frame "births" and "deaths." The pattern is constant between accesses and may only change when 
the application accesses the cache. The time periods between application accesses are possible 
5 interrupt points. Recognizing that an interrupt may occur at any time, a refinement may be 
achieved by bounding the effect of an interrupt at each possible interrupt point and then taking 
the maximum of these as the worst-case bound. 
Set Model 

Essentially, the previous set intersection approach is used except that the 
10 cardinality of the intersection of the interrupt set (containing the cache sets it accesses) is taken 
with each of multiple application sets (each set containing its live-sets), one for each possible 
□ interrupt point. The maximum of these is the bound, i.e., the number of application misses that 
f f may be caused by the interrupt if it occurs at the point that causes the maximum number of 
W evictions from live application frames, 
fg Vector Implementation of Set Model 

J. 4 FIG. 7 illustrates a set vector (ISV) 720 that represents the sets 710 accessed by an 

fjl interrupt. In other words, the set vector (ISV) 720 indicates those sets that are live at any point in 
D time. A set-profile 710 is formed that represents the sets accessed over time by an interrupt 
PI during its execution. The set profile 710 is a structure similar to the plot in FIG. 5B, except that 
20 its shaded entries represent sets that are accessed, and not live sets. A set- vector 720 is formed 
from the set-profile 710 that represents all the cache sets accessed by an interrupt, regardless of 
time. The entries of the set-profile 710 are associated with cache sets one-to-one. A shaded 
entry in the set-vector 720 represents a set that is accessed by the interrupt. 

The "application sets" described above in the sub-section entitled "Set Model" are 
25 represented by column vectors in an "application live-set profile" matrix 800 shown in FIG. 8. 
The rows of the application live-set profile matrix 800 represent individual cache sets. The live- 
set profile 800 is the distribution of live-sets in the cache during the execution of the application. 
A column vector 820 represents the set vector of the interrupt. The matrix 800 is similar to FIG. 
5B except that a live-set with one live frame is shaded light gray and one with two live frames is 
30 shaded dark gray. 
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Since the row dimensions of the interrupt set- vector (IS V) 820 and the application 
set profile (AP) 800 are the same (the number of cache sets), the cardinality of the intersection of 
the sets represented by the ISV and a column in the AP 800 is achieved by examining shaded 
elements of each column that are in the same row. As shown in FIG. 8, if an element in the AP 
5 800 is light gray, at most one live frame in the corresponding set may experience an eviction, and 
one is added to the cardinality of the intersection. If the AP element is dark gray, at most two live 
frames in the corresponding set may experience evections, and two is added to the cardinality of 
the intersection. 

The cardinalities of the intersections of the set represented by the ISV with the 
10 sets represented by columns in the AP is found. They each represent the maximum number of 
live frames that may experience evictions caused by an interrupt, at the interrupt point 
O represented by the AP column. The maximum of these cardinalities is the bound and represents 
JK the maximum effect of an interrupt occurring at the worst possible time. 

fl These cardinalities are shown in FIG. 8 under the columns of the live-set profile 

: 5 J 

© 800. For example, if the interrupt occurs during the first time period (the first column), it may 

fli £ 

cause a maximum of seven additional misses. A maximum count (23) results from an interrupt 

5: at the third time point, causing the worst-case effect on the application shown. 

ill 

HI This approach assumes that every block inserted by an interrupt into a live-set that 

Pi contains one live frame evicts the block in that frame. This is conservative if the set contains two 
% or more frames. In that case, the interrupt may not access the live frame. While conservative, 
the approach achieves a tight bound. Methods producing tighter bounds must either track least 
recently used state over the run time of a system (an unrealistic endeavor) or set least recently 
used state at appropriate times. 

When this approach is applied to the exemplary AMR application discussed above 
25 in conjunction with FIGS. 5A and 5B, and an illustrative interrupt which accesses 64 out of 128 
cache sets (i.e., the indexes of the sets it accesses are 0x00 to OxOF) improved results are 
obtained. The result is that this interrupt will cause no more than 48 misses in the AMR 
application regardless of when it occurs. Compare this result to that of the initial simplistic 
approach (256 misses) and the initial live frame analysis (162 misses). These two results are 
30 independent of the nature of the interrupt, so they may be compared with the current result. 
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FIG. 9 is a schematic block diagram showing the architecture of an exemplary 
software developer's computer 900 incorporating features of the present invention. The software 
developer's computer 900 may be embodied as a general purpose computing system, such as the 
general purpose computing system shown in FIG. 9. The software developer's computer 900 
5 includes a processor 910 and related memory, such as a data storage device 920, which may be 
distributed or local. The processor 910 may be embodied as a single processor, or a number of 
local or distributed processors operating in parallel. The data storage device 920 and/or a read 
only memory (ROM) are operable to store one or more instructions, which the processor 910 is 
operable to retrieve, interpret and execute. 
10 As shown in FIG. 9, the data storage device 920 of the software developer's 

computer 900 optionally contains one or more application programs 960 that are being developed 
E or that may be sharing the cache and interfere with the programs 960. It is noted that the 
O application programs 960 being analyzed may be developed and/or executed on a remote 
U machine and merely analyzed on the software developer's computer 900, as would be apparent to 
3$ a person of ordinary skill in the art. In addition, the data storage device 920 includes a task 
W interference bound analysis process 980 incorporating features of the present invention. 
O As is known in the art, the methods and apparatus discussed herein may be 

« J distributed as an article of manufacture that itself comprises a computer readable medium having 
0 computer readable code means embodied thereon. The computer readable program code means is 
SJ) operable, in conjunction with a computer system, to carry out all or some of the steps to perform 
the methods or create the apparatuses discussed herein. The computer readable medium may be a 
recordable medium (e.g., floppy disks, hard drives, compact disks, or memory cards) or may be a 
transmission medium (e.g., a network comprising fiber-optics, the world-wide web, cables, or a 
wireless channel using time-division multiple access, code-division multiple access, or other 
25 radio-frequency channel). Any medium known or developed that can store information suitable 
for use with a computer system may be used. The computer-readable code means is any 
mechanism for allowing a computer to read instructions and data, such as magnetic variations on 
a magnetic media or height variations on the surface of a compact disk. 

The present invention achieves a tight worst-case execution time bound on the 
30 effect of task interference in an instruction cache. The present invention produces a bound that is 
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time independent (i.e., it is valid regardless of when interventions occur). One embodiment 
achieves a bound that is independent of the cache sets accessed by intervening tasks. The bound 
is the maximum live frames coexistent in the analyzed task. 

If the sets accessed by intervening tasks may be determined, a tighter bound is 
5 achieved by using two alternate embodiments. These alternate embodiments are suitable for 
embedded systems in which a limited number of tasks execute and system efficiency is extremely 
important, a relatively common situation. 

The present invention assumes that the cache sets accessed by an analyzed task 
may be determined via simulation. This may not be practical in some situations, e.g., if many 
10 different set access patterns are possible, and they do not all occur during extensive simulation. 
However, it is sometimes sufficient or prudent to establish a "pseudo-bound," one that is rarely 
exceeded, instead of one that is never exceeded. It may be wasteful to design a system that 
0 accommodates infrequent situations, especially if they are responsible for a large increase in a 
P bound. This is often true in communications systems. 

Sf It is to be understood that the embodiments and variations shown and described 

81 herein are merely illustrative of the principles of this invention and that various modifications 
p may be implemented by those skilled in the art without departing from the scope and spirit of the 
Li invention. 
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